Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 8659 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 328 |
| Duplicate rows (%) | 3.8% |
| Total size in memory | 896.5 KiB |
| Average record size in memory | 106.0 B |
Variable types
| Boolean | 2 |
|---|---|
| Categorical | 5 |
| Numeric | 8 |
| Dataset has 328 (3.8%) duplicate rows | Duplicates |
Cabin_deck is highly overall correlated with HomePlanet | High correlation |
Consumption_Basic is highly overall correlated with Consumption_High_End and 4 other fields | High correlation |
Consumption_High_End is highly overall correlated with Consumption_Basic and 4 other fields | High correlation |
CryoSleep is highly overall correlated with FoodCourt and 4 other fields | High correlation |
FoodCourt is highly overall correlated with Consumption_Basic and 3 other fields | High correlation |
HomePlanet is highly overall correlated with Cabin_deck | High correlation |
RoomService is highly overall correlated with Consumption_High_End and 1 other fields | High correlation |
ShoppingMall is highly overall correlated with Consumption_Basic and 1 other fields | High correlation |
Spa is highly overall correlated with Consumption_Basic and 2 other fields | High correlation |
VRDeck is highly overall correlated with Consumption_Basic and 3 other fields | High correlation |
VIP is highly imbalanced (84.5%) | Imbalance |
RoomService has 5645 (65.2%) zeros | Zeros |
FoodCourt has 5541 (64.0%) zeros | Zeros |
ShoppingMall has 5684 (65.6%) zeros | Zeros |
Spa has 5398 (62.3%) zeros | Zeros |
VRDeck has 5583 (64.5%) zeros | Zeros |
Consumption_High_End has 3805 (43.9%) zeros | Zeros |
Consumption_Basic has 4164 (48.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-07 12:02:54.792678 |
|---|---|
| Analysis finished | 2024-05-07 12:03:13.903874 |
| Duration | 19.11 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
CryoSleep
Boolean
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 5537 | |
| True | 3122 |
Destination
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.8 KiB |
| TRAPPIST-1e | |
|---|---|
| 55 Cancri e | |
| PSO J318.5-22 |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 11.183624 |
| Min length | 11 |
Characters and Unicode
| Total characters | 96839 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TRAPPIST-1e |
|---|---|
| 2nd row | TRAPPIST-1e |
| 3rd row | TRAPPIST-1e |
| 4th row | TRAPPIST-1e |
| 5th row | TRAPPIST-1e |
Common Values
| Value | Count | Frequency (%) |
| TRAPPIST-1e | 6071 | |
| 55 Cancri e | 1793 | 20.7% |
| PSO J318.5-22 | 795 | 9.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| trappist-1e | 6071 | |
| 55 | 1793 | 13.8% |
| cancri | 1793 | 13.8% |
| e | 1793 | 13.8% |
| pso | 795 | 6.1% |
| j318.5-22 | 795 | 6.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 12937 | |
| T | 12142 | |
| e | 7864 | 8.1% |
| S | 6866 | 7.1% |
| - | 6866 | 7.1% |
| 1 | 6866 | 7.1% |
| A | 6071 | 6.3% |
| I | 6071 | 6.3% |
| R | 6071 | 6.3% |
| 5 | 4381 | 4.5% |
| Other values (13) | 20704 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 96839 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 12937 | |
| T | 12142 | |
| e | 7864 | 8.1% |
| S | 6866 | 7.1% |
| - | 6866 | 7.1% |
| 1 | 6866 | 7.1% |
| A | 6071 | 6.3% |
| I | 6071 | 6.3% |
| R | 6071 | 6.3% |
| 5 | 4381 | 4.5% |
| Other values (13) | 20704 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 96839 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 12937 | |
| T | 12142 | |
| e | 7864 | 8.1% |
| S | 6866 | 7.1% |
| - | 6866 | 7.1% |
| 1 | 6866 | 7.1% |
| A | 6071 | 6.3% |
| I | 6071 | 6.3% |
| R | 6071 | 6.3% |
| 5 | 4381 | 4.5% |
| Other values (13) | 20704 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 96839 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 12937 | |
| T | 12142 | |
| e | 7864 | 8.1% |
| S | 6866 | 7.1% |
| - | 6866 | 7.1% |
| 1 | 6866 | 7.1% |
| A | 6071 | 6.3% |
| I | 6071 | 6.3% |
| R | 6071 | 6.3% |
| 5 | 4381 | 4.5% |
| Other values (13) | 20704 |
VIP
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.6 KiB |
| False | |
|---|---|
| True | 195 |
| Value | Count | Frequency (%) |
| False | 8464 | |
| True | 195 | 2.3% |
RoomService
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1361 |
|---|---|
| Distinct (%) | 15.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 221.63415 |
| Minimum | 0 |
|---|---|
| Maximum | 14327 |
| Zeros | 5645 |
| Zeros (%) | 65.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 55 |
| 95-th percentile | 1256.2 |
| Maximum | 14327 |
| Range | 14327 |
| Interquartile range (IQR) | 55 |
Descriptive statistics
| Standard deviation | 648.26018 |
|---|---|
| Coefficient of variation (CV) | 2.924911 |
| Kurtosis | 67.75414 |
| Mean | 221.63415 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.3564064 |
| Sum | 1919130.1 |
| Variance | 420241.26 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5645 | |
| 1 | 117 | 1.4% |
| 2 | 78 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 9 | 25 | 0.3% |
| 8 | 24 | 0.3% |
| 6 | 24 | 0.3% |
| 14 | 21 | 0.2% |
| Other values (1351) | 2589 |
| Value | Count | Frequency (%) |
| 0 | 5645 | |
| 1 | 117 | 1.4% |
| 1.077766015 | 1 | < 0.1% |
| 2 | 78 | 0.9% |
| 3 | 61 | 0.7% |
| 4 | 47 | 0.5% |
| 5 | 28 | 0.3% |
| 6 | 24 | 0.3% |
| 7 | 17 | 0.2% |
| 8 | 24 | 0.3% |
| Value | Count | Frequency (%) |
| 14327 | 1 | |
| 9920 | 1 | |
| 8586 | 1 | |
| 8243 | 1 | |
| 8209 | 1 | |
| 8168 | 1 | |
| 8142 | 1 | |
| 8030 | 1 | |
| 7406 | 1 | |
| 7172 | 1 |
FoodCourt
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1577 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 427.79642 |
| Minimum | 0 |
|---|---|
| Maximum | 29813 |
| Zeros | 5541 |
| Zeros (%) | 64.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 81.5 |
| 95-th percentile | 2561.6 |
| Maximum | 29813 |
| Range | 29813 |
| Interquartile range (IQR) | 81.5 |
Descriptive statistics
| Standard deviation | 1502.8659 |
|---|---|
| Coefficient of variation (CV) | 3.5130398 |
| Kurtosis | 85.968892 |
| Mean | 427.79642 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.5336215 |
| Sum | 3704289.2 |
| Variance | 2258605.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5541 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 4 | 53 | 0.6% |
| 3 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 9 | 28 | 0.3% |
| 7 | 27 | 0.3% |
| 10 | 27 | 0.3% |
| Other values (1567) | 2675 |
| Value | Count | Frequency (%) |
| 0 | 5541 | |
| 1 | 116 | 1.3% |
| 2 | 75 | 0.9% |
| 3 | 53 | 0.6% |
| 4 | 53 | 0.6% |
| 5 | 33 | 0.4% |
| 6 | 31 | 0.4% |
| 7 | 27 | 0.3% |
| 8 | 20 | 0.2% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 29813 | 1 | |
| 27723 | 1 | |
| 27071 | 1 | |
| 26830 | 1 | |
| 18481 | 1 | |
| 17958 | 1 | |
| 17901 | 1 | |
| 17687 | 1 | |
| 17432 | 1 | |
| 17394 | 1 |
ShoppingMall
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1201 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 169.56456 |
| Minimum | 0 |
|---|---|
| Maximum | 23492 |
| Zeros | 5684 |
| Zeros (%) | 65.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 30 |
| 95-th percentile | 921 |
| Maximum | 23492 |
| Range | 23492 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 573.09382 |
|---|---|
| Coefficient of variation (CV) | 3.3797971 |
| Kurtosis | 373.75265 |
| Mean | 169.56456 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.003727 |
| Sum | 1468259.6 |
| Variance | 328436.53 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5684 | |
| 1 | 152 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 7 | 35 | 0.4% |
| 6 | 34 | 0.4% |
| 13 | 29 | 0.3% |
| 9 | 28 | 0.3% |
| Other values (1191) | 2475 |
| Value | Count | Frequency (%) |
| 0 | 5684 | |
| 1 | 152 | 1.8% |
| 2 | 80 | 0.9% |
| 3 | 59 | 0.7% |
| 4 | 45 | 0.5% |
| 5 | 38 | 0.4% |
| 6 | 34 | 0.4% |
| 7 | 35 | 0.4% |
| 8 | 28 | 0.3% |
| 9 | 28 | 0.3% |
| Value | Count | Frequency (%) |
| 23492 | 1 | |
| 12253 | 1 | |
| 9058 | 1 | |
| 7810 | 1 | |
| 7185 | 1 | |
| 7148 | 1 | |
| 7104 | 1 | |
| 6805 | 1 | |
| 6331 | 1 | |
| 6221 | 1 |
Spa
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1404 |
|---|---|
| Distinct (%) | 16.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 284.87394 |
| Minimum | 0 |
|---|---|
| Maximum | 16139 |
| Zeros | 5398 |
| Zeros (%) | 62.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 64 |
| 95-th percentile | 1520.5 |
| Maximum | 16139 |
| Range | 16139 |
| Interquartile range (IQR) | 64 |
Descriptive statistics
| Standard deviation | 983.61954 |
|---|---|
| Coefficient of variation (CV) | 3.4528239 |
| Kurtosis | 66.755155 |
| Mean | 284.87394 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.943894 |
| Sum | 2466723.4 |
| Variance | 967507.41 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5398 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 5 | 53 | 0.6% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 7 | 34 | 0.4% |
| 6 | 33 | 0.4% |
| 9 | 28 | 0.3% |
| 8 | 28 | 0.3% |
| Other values (1394) | 2735 |
| Value | Count | Frequency (%) |
| 0 | 5398 | |
| 1 | 146 | 1.7% |
| 2 | 105 | 1.2% |
| 3 | 53 | 0.6% |
| 4 | 46 | 0.5% |
| 4.560684576 | 1 | < 0.1% |
| 5 | 53 | 0.6% |
| 5.642490463 | 1 | < 0.1% |
| 6 | 33 | 0.4% |
| 7 | 34 | 0.4% |
| Value | Count | Frequency (%) |
| 16139 | 1 | |
| 15586 | 1 | |
| 15331 | 1 | |
| 15238 | 1 | |
| 13995 | 1 | |
| 13104 | 1 | |
| 12062 | 1 | |
| 11001 | 1 | |
| 10976 | 1 | |
| 10941 | 1 |
VRDeck
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1381 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 285.62515 |
| Minimum | 0 |
|---|---|
| Maximum | 24133 |
| Zeros | 5583 |
| Zeros (%) | 64.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 49.5 |
| 95-th percentile | 1457.2 |
| Maximum | 24133 |
| Range | 24133 |
| Interquartile range (IQR) | 49.5 |
Descriptive statistics
| Standard deviation | 1056.8989 |
|---|---|
| Coefficient of variation (CV) | 3.7003007 |
| Kurtosis | 97.349881 |
| Mean | 285.62515 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.0971526 |
| Sum | 2473228.2 |
| Variance | 1117035.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 5583 | |
| 1 | 138 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 5 | 51 | 0.6% |
| 4 | 47 | 0.5% |
| 6 | 32 | 0.4% |
| 8 | 30 | 0.3% |
| 7 | 29 | 0.3% |
| 9 | 25 | 0.3% |
| Other values (1371) | 2598 |
| Value | Count | Frequency (%) |
| 0 | 5583 | |
| 1 | 138 | 1.6% |
| 2 | 70 | 0.8% |
| 3 | 56 | 0.6% |
| 4 | 47 | 0.5% |
| 5 | 51 | 0.6% |
| 6 | 32 | 0.4% |
| 7 | 29 | 0.3% |
| 8 | 30 | 0.3% |
| 9 | 25 | 0.3% |
| Value | Count | Frequency (%) |
| 24133 | 1 | |
| 20336 | 1 | |
| 17074 | 1 | |
| 16337 | 1 | |
| 12708 | 1 | |
| 12682 | 1 | |
| 12424 | 1 | |
| 12392 | 1 | |
| 12323 | 1 | |
| 12143 | 1 |
Cabin_deck
Categorical
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.8 KiB |
| F | |
|---|---|
| G | |
| E | |
| B | |
| C | |
| Other values (3) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8659 |
|---|---|
| Distinct characters | 8 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | F |
| 3rd row | A |
| 4th row | A |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 875 | 10.1% |
| B | 804 | 9.3% |
| C | 752 | 8.7% |
| D | 483 | 5.6% |
| A | 257 | 3.0% |
| T | 5 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 2868 | |
| g | 2615 | |
| e | 875 | 10.1% |
| b | 804 | 9.3% |
| c | 752 | 8.7% |
| d | 483 | 5.6% |
| a | 257 | 3.0% |
| t | 5 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 875 | 10.1% |
| B | 804 | 9.3% |
| C | 752 | 8.7% |
| D | 483 | 5.6% |
| A | 257 | 3.0% |
| T | 5 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8659 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 875 | 10.1% |
| B | 804 | 9.3% |
| C | 752 | 8.7% |
| D | 483 | 5.6% |
| A | 257 | 3.0% |
| T | 5 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8659 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 875 | 10.1% |
| B | 804 | 9.3% |
| C | 752 | 8.7% |
| D | 483 | 5.6% |
| A | 257 | 3.0% |
| T | 5 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8659 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 2868 | |
| G | 2615 | |
| E | 875 | 10.1% |
| B | 804 | 9.3% |
| C | 752 | 8.7% |
| D | 483 | 5.6% |
| A | 257 | 3.0% |
| T | 5 | 0.1% |
Group_size
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0349925 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 6 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.5964559 |
|---|---|
| Coefficient of variation (CV) | 0.78450211 |
| Kurtosis | 3.1734359 |
| Mean | 2.0349925 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.8903565 |
| Sum | 17621 |
| Variance | 2.5486714 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 4790 | |
| 2 | 1671 | 19.3% |
| 3 | 1018 | 11.8% |
| 4 | 410 | 4.7% |
| 5 | 263 | 3.0% |
| 7 | 230 | 2.7% |
| 6 | 173 | 2.0% |
| 8 | 104 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 4790 | |
| 2 | 1671 | 19.3% |
| 3 | 1018 | 11.8% |
| 4 | 410 | 4.7% |
| 5 | 263 | 3.0% |
| 6 | 173 | 2.0% |
| 7 | 230 | 2.7% |
| 8 | 104 | 1.2% |
| Value | Count | Frequency (%) |
| 8 | 104 | 1.2% |
| 7 | 230 | 2.7% |
| 6 | 173 | 2.0% |
| 5 | 263 | 3.0% |
| 4 | 410 | 4.7% |
| 3 | 1018 | 11.8% |
| 2 | 1671 | 19.3% |
| 1 | 4790 |
HomePlanet
Categorical
HIGH CORRELATION 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.8 KiB |
| Earth | |
|---|---|
| Europa | |
| Mars |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 5.0383416 |
| Min length | 4 |
Characters and Unicode
| Total characters | 43627 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Europa |
|---|---|
| 2nd row | Earth |
| 3rd row | Europa |
| 4th row | Europa |
| 5th row | Earth |
Common Values
| Value | Count | Frequency (%) |
| Earth | 4709 | |
| Europa | 2141 | |
| Mars | 1809 | 20.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| earth | 4709 | |
| europa | 2141 | |
| mars | 1809 | 20.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8659 | |
| r | 8659 | |
| E | 6850 | |
| t | 4709 | |
| h | 4709 | |
| u | 2141 | 4.9% |
| o | 2141 | 4.9% |
| p | 2141 | 4.9% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43627 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 8659 | |
| r | 8659 | |
| E | 6850 | |
| t | 4709 | |
| h | 4709 | |
| u | 2141 | 4.9% |
| o | 2141 | 4.9% |
| p | 2141 | 4.9% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43627 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 8659 | |
| r | 8659 | |
| E | 6850 | |
| t | 4709 | |
| h | 4709 | |
| u | 2141 | 4.9% |
| o | 2141 | 4.9% |
| p | 2141 | 4.9% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43627 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 8659 | |
| r | 8659 | |
| E | 6850 | |
| t | 4709 | |
| h | 4709 | |
| u | 2141 | 4.9% |
| o | 2141 | 4.9% |
| p | 2141 | 4.9% |
| M | 1809 | 4.1% |
| s | 1809 | 4.1% |
Transported
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.8 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 8659 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 4375 | |
| 0 | 4284 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 4375 | |
| 0 | 4284 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4375 | |
| 0 | 4284 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8659 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4375 | |
| 0 | 4284 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8659 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4375 | |
| 0 | 4284 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8659 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 4375 | |
| 0 | 4284 |
Consumption_High_End
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 2540 |
|---|---|
| Distinct (%) | 29.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 792.13324 |
| Minimum | 0 |
|---|---|
| Maximum | 25463.229 |
| Zeros | 3805 |
| Zeros (%) | 43.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 80 |
| Q3 | 857 |
| 95-th percentile | 3660 |
| Maximum | 25463.229 |
| Range | 25463.229 |
| Interquartile range (IQR) | 857 |
Descriptive statistics
| Standard deviation | 1643.9217 |
|---|---|
| Coefficient of variation (CV) | 2.0753096 |
| Kurtosis | 29.791174 |
| Mean | 792.13324 |
| Median Absolute Deviation (MAD) | 80 |
| Skewness | 4.5050317 |
| Sum | 6859081.7 |
| Variance | 2702478.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3805 | |
| 1 | 26 | 0.3% |
| 2 | 23 | 0.3% |
| 4 | 22 | 0.3% |
| 3 | 20 | 0.2% |
| 5 | 18 | 0.2% |
| 7 | 16 | 0.2% |
| 804 | 15 | 0.2% |
| 11 | 15 | 0.2% |
| 6 | 13 | 0.2% |
| Other values (2530) | 4686 |
| Value | Count | Frequency (%) |
| 0 | 3805 | |
| 1 | 26 | 0.3% |
| 1.077766015 | 1 | < 0.1% |
| 2 | 23 | 0.3% |
| 3 | 20 | 0.2% |
| 4 | 22 | 0.3% |
| 5 | 18 | 0.2% |
| 6 | 13 | 0.2% |
| 7 | 16 | 0.2% |
| 7.395469138 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 25463.22895 | 1 | |
| 20961 | 1 | |
| 18037 | 1 | |
| 17928 | 1 | |
| 16826 | 1 | |
| 16762 | 1 | |
| 16394 | 1 | |
| 16059 | 1 | |
| 15758 | 1 | |
| 14695 | 1 |
Consumption_Basic
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 2110 |
|---|---|
| Distinct (%) | 24.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 597.36099 |
| Minimum | 0 |
|---|---|
| Maximum | 29813 |
| Zeros | 4164 |
| Zeros (%) | 48.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 67.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 608 |
| 95-th percentile | 3070.6 |
| Maximum | 29813 |
| Range | 29813 |
| Interquartile range (IQR) | 608 |
Descriptive statistics
| Standard deviation | 1599.5818 |
|---|---|
| Coefficient of variation (CV) | 2.6777474 |
| Kurtosis | 71.155004 |
| Mean | 597.36099 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 6.7157102 |
| Sum | 5172548.8 |
| Variance | 2558662 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4164 | |
| 1 | 98 | 1.1% |
| 2 | 53 | 0.6% |
| 3 | 41 | 0.5% |
| 5 | 40 | 0.5% |
| 4 | 38 | 0.4% |
| 6 | 32 | 0.4% |
| 10 | 30 | 0.3% |
| 13 | 29 | 0.3% |
| 7 | 28 | 0.3% |
| Other values (2100) | 4106 |
| Value | Count | Frequency (%) |
| 0 | 4164 | |
| 1 | 98 | 1.1% |
| 2 | 53 | 0.6% |
| 3 | 41 | 0.5% |
| 4 | 38 | 0.4% |
| 5 | 40 | 0.5% |
| 6 | 32 | 0.4% |
| 7 | 28 | 0.3% |
| 8 | 17 | 0.2% |
| 9 | 26 | 0.3% |
| Value | Count | Frequency (%) |
| 29813 | 1 | |
| 27726 | 1 | |
| 27071 | 1 | |
| 26830 | 1 | |
| 23858 | 1 | |
| 18481 | 1 | |
| 18057 | 1 | |
| 17901 | 1 | |
| 17687 | 1 | |
| 17432 | 1 |
Age_group
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 67.8 KiB |
| Young adults | |
|---|---|
| Middle-aged | |
| Minor | |
| Senior | 250 |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 10.389075 |
| Min length | 5 |
Characters and Unicode
| Total characters | 89959 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Young adults |
|---|---|
| 2nd row | Young adults |
| 3rd row | Middle-aged |
| 4th row | Young adults |
| 5th row | Minor |
Common Values
| Value | Count | Frequency (%) |
| Young adults | 5260 | |
| Middle-aged | 1599 | 18.5% |
| Minor | 1550 | 17.9% |
| Senior | 250 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| young | 5260 | |
| adults | 5260 | |
| middle-aged | 1599 | 11.5% |
| minor | 1550 | 11.1% |
| senior | 250 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 10520 | |
| d | 10057 | |
| n | 7060 | 7.8% |
| o | 7060 | 7.8% |
| l | 6859 | 7.6% |
| g | 6859 | 7.6% |
| a | 6859 | 7.6% |
| t | 5260 | 5.8% |
| s | 5260 | 5.8% |
| Y | 5260 | 5.8% |
| Other values (7) | 18905 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 89959 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| u | 10520 | |
| d | 10057 | |
| n | 7060 | 7.8% |
| o | 7060 | 7.8% |
| l | 6859 | 7.6% |
| g | 6859 | 7.6% |
| a | 6859 | 7.6% |
| t | 5260 | 5.8% |
| s | 5260 | 5.8% |
| Y | 5260 | 5.8% |
| Other values (7) | 18905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 89959 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| u | 10520 | |
| d | 10057 | |
| n | 7060 | 7.8% |
| o | 7060 | 7.8% |
| l | 6859 | 7.6% |
| g | 6859 | 7.6% |
| a | 6859 | 7.6% |
| t | 5260 | 5.8% |
| s | 5260 | 5.8% |
| Y | 5260 | 5.8% |
| Other values (7) | 18905 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 89959 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| u | 10520 | |
| d | 10057 | |
| n | 7060 | 7.8% |
| o | 7060 | 7.8% |
| l | 6859 | 7.6% |
| g | 6859 | 7.6% |
| a | 6859 | 7.6% |
| t | 5260 | 5.8% |
| s | 5260 | 5.8% |
| Y | 5260 | 5.8% |
| Other values (7) | 18905 |
| Age_group | Cabin_deck | Consumption_Basic | Consumption_High_End | CryoSleep | Destination | FoodCourt | Group_size | HomePlanet | RoomService | ShoppingMall | Spa | Transported | VIP | VRDeck | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Age_group | 1.000 | 0.160 | 0.085 | 0.108 | 0.122 | 0.026 | 0.062 | -0.150 | 0.137 | 0.089 | 0.080 | 0.081 | 0.117 | 0.074 | 0.070 |
| Cabin_deck | 0.160 | 1.000 | -0.259 | -0.258 | 0.339 | 0.246 | -0.268 | -0.141 | 0.754 | -0.055 | -0.041 | -0.223 | 0.221 | 0.198 | -0.223 |
| Consumption_Basic | 0.085 | -0.259 | 1.000 | 0.632 | 0.173 | 0.086 | 0.772 | -0.110 | 0.261 | 0.402 | 0.653 | 0.522 | 0.093 | 0.141 | 0.500 |
| Consumption_High_End | 0.108 | -0.258 | 0.632 | 1.000 | 0.218 | 0.088 | 0.527 | -0.122 | 0.252 | 0.611 | 0.410 | 0.704 | 0.257 | 0.117 | 0.672 |
| CryoSleep | 0.122 | 0.339 | 0.173 | 0.218 | 1.000 | 0.122 | -0.545 | 0.101 | 0.124 | -0.532 | -0.527 | -0.563 | 0.466 | 0.078 | -0.540 |
| Destination | 0.026 | 0.246 | 0.086 | 0.088 | 0.122 | 1.000 | -0.017 | -0.035 | 0.262 | 0.102 | 0.092 | 0.024 | 0.113 | 0.045 | -0.008 |
| FoodCourt | 0.062 | -0.268 | 0.772 | 0.527 | -0.545 | -0.017 | 1.000 | -0.059 | 0.253 | 0.194 | 0.199 | 0.482 | 0.080 | 0.135 | 0.507 |
| Group_size | -0.150 | -0.141 | -0.110 | -0.122 | 0.101 | -0.035 | -0.059 | 1.000 | 0.241 | -0.144 | -0.142 | -0.078 | 0.127 | 0.044 | -0.077 |
| HomePlanet | 0.137 | 0.754 | 0.261 | 0.252 | 0.124 | 0.262 | 0.253 | 0.241 | 1.000 | 0.113 | 0.049 | -0.003 | 0.202 | 0.174 | -0.076 |
| RoomService | 0.089 | -0.055 | 0.402 | 0.611 | -0.532 | 0.102 | 0.194 | -0.144 | 0.113 | 1.000 | 0.447 | 0.259 | 0.161 | 0.052 | 0.188 |
| ShoppingMall | 0.080 | -0.041 | 0.653 | 0.410 | -0.527 | 0.092 | 0.199 | -0.142 | 0.049 | 0.447 | 1.000 | 0.267 | 0.034 | 0.007 | 0.208 |
| Spa | 0.081 | -0.223 | 0.522 | 0.704 | -0.563 | 0.024 | 0.482 | -0.078 | -0.003 | 0.259 | 0.267 | 1.000 | 0.186 | 0.078 | 0.443 |
| Transported | 0.117 | 0.221 | 0.093 | 0.257 | 0.466 | 0.113 | 0.080 | 0.127 | 0.202 | 0.161 | 0.034 | 0.186 | 1.000 | 0.033 | -0.352 |
| VIP | 0.074 | 0.198 | 0.141 | 0.117 | 0.078 | 0.045 | 0.135 | 0.044 | 0.174 | 0.052 | 0.007 | 0.078 | 0.033 | 1.000 | 0.095 |
| VRDeck | 0.070 | -0.223 | 0.500 | 0.672 | -0.540 | -0.008 | 0.507 | -0.077 | -0.076 | 0.188 | 0.208 | 0.443 | -0.352 | 0.095 | 1.000 |
| CryoSleep | Destination | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Cabin_deck | Group_size | HomePlanet | Transported | Consumption_High_End | Consumption_Basic | Age_group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | False | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | B | 1 | Europa | 0 | 0.0 | 0.0 | Young adults |
| 1 | False | TRAPPIST-1e | False | 109.0 | 9.0 | 25.0 | 549.0 | 44.0 | F | 1 | Earth | 1 | 702.0 | 34.0 | Young adults |
| 2 | False | TRAPPIST-1e | True | 43.0 | 3576.0 | 0.0 | 6715.0 | 49.0 | A | 2 | Europa | 0 | 6807.0 | 3576.0 | Middle-aged |
| 3 | False | TRAPPIST-1e | False | 0.0 | 1283.0 | 371.0 | 3329.0 | 193.0 | A | 2 | Europa | 0 | 3522.0 | 1654.0 | Young adults |
| 4 | False | TRAPPIST-1e | False | 303.0 | 70.0 | 151.0 | 565.0 | 2.0 | F | 1 | Earth | 1 | 870.0 | 221.0 | Minor |
| 5 | False | PSO J318.5-22 | False | 0.0 | 483.0 | 0.0 | 291.0 | 0.0 | F | 1 | Earth | 1 | 291.0 | 483.0 | Middle-aged |
| 6 | False | TRAPPIST-1e | False | 42.0 | 1539.0 | 3.0 | 0.0 | 0.0 | F | 2 | Earth | 1 | 42.0 | 1542.0 | Young adults |
| 7 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 2 | Earth | 1 | 0.0 | 0.0 | Young adults |
| 8 | False | TRAPPIST-1e | False | 0.0 | 785.0 | 17.0 | 216.0 | 0.0 | F | 1 | Earth | 1 | 216.0 | 802.0 | Young adults |
| 9 | True | 55 Cancri e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | B | 3 | Europa | 1 | 0.0 | 0.0 | Minor |
| CryoSleep | Destination | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Cabin_deck | Group_size | HomePlanet | Transported | Consumption_High_End | Consumption_Basic | Age_group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8649 | False | TRAPPIST-1e | False | 86.0 | 3.0 | 149.0 | 208.0 | 329.0 | F | 2 | Earth | 0 | 623.0 | 152.0 | Young adults |
| 8650 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 1 | 0.0 | 0.0 | Young adults |
| 8651 | False | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | A | 3 | Europa | 1 | 0.0 | 0.0 | Minor |
| 8652 | False | TRAPPIST-1e | False | 1.0 | 1146.0 | 0.0 | 50.0 | 34.0 | A | 3 | Europa | 0 | 85.0 | 1146.0 | Young adults |
| 8653 | False | TRAPPIST-1e | False | 0.0 | 3208.0 | 0.0 | 2.0 | 330.0 | A | 3 | Europa | 1 | 332.0 | 3208.0 | Young adults |
| 8654 | False | 55 Cancri e | True | 0.0 | 6819.0 | 0.0 | 1643.0 | 74.0 | A | 1 | Europa | 0 | 1717.0 | 6819.0 | Middle-aged |
| 8655 | True | PSO J318.5-22 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 0 | 0.0 | 0.0 | Young adults |
| 8656 | False | TRAPPIST-1e | False | 0.0 | 0.0 | 1872.0 | 1.0 | 0.0 | G | 1 | Earth | 1 | 1.0 | 1872.0 | Young adults |
| 8657 | False | 55 Cancri e | False | 0.0 | 1049.0 | 0.0 | 353.0 | 3235.0 | E | 2 | Europa | 0 | 3588.0 | 1049.0 | Young adults |
| 8658 | False | TRAPPIST-1e | False | 126.0 | 4688.0 | 0.0 | 0.0 | 12.0 | E | 2 | Europa | 1 | 138.0 | 4688.0 | Middle-aged |
Most frequently occurring
| CryoSleep | Destination | VIP | RoomService | FoodCourt | ShoppingMall | Spa | VRDeck | Cabin_deck | Group_size | HomePlanet | Transported | Consumption_High_End | Consumption_Basic | Age_group | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 293 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 1 | 0.0 | 0.0 | Young adults | 175 |
| 269 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | F | 1 | Mars | 1 | 0.0 | 0.0 | Young adults | 174 |
| 289 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 0 | 0.0 | 0.0 | Young adults | 122 |
| 169 | True | PSO J318.5-22 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 1 | 0.0 | 0.0 | Young adults | 109 |
| 291 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 1 | 0.0 | 0.0 | Minor | 74 |
| 274 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | F | 2 | Mars | 1 | 0.0 | 0.0 | Young adults | 63 |
| 205 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | B | 2 | Europa | 1 | 0.0 | 0.0 | Young adults | 59 |
| 165 | True | PSO J318.5-22 | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 0 | 0.0 | 0.0 | Young adults | 58 |
| 277 | True | TRAPPIST-1e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | F | 3 | Mars | 1 | 0.0 | 0.0 | Minor | 53 |
| 139 | True | 55 Cancri e | False | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | G | 1 | Earth | 1 | 0.0 | 0.0 | Young adults | 52 |